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Abstract Differences in codon usage can seriously 
hamper the expression of cloned cDNAs in heterologous 
1 systems. In this study, we show that the expression of a 
cloned Schistosoma mansoni cDNA in cultured HEK293 
cells was dramatically increased by rewriting a portion 
of the cDNA according to human preferred codon 
usage, suggesting that codon optimization is a valuable 
strategy for improving the heterologous expression of 
helminth sequences. We further describe a simple mod- 
ification of a recursive PCR-based method, which allows 
the rewriting of long stretches of DNA sequence in a 
single PCR reaction. This method can be used to opti- 
mize the codon usage of virtually any DNA from 
helminths and other parasites. 



The recent growth of helminth genomic and EST (ex- 
pressed sequence tag) sequencing data has increased the 
need for suitable heterologous expression systems in 
which to test the function of cloned DNA sequences. 
Unfortunately, helminth DNAs often express poorly in 
heterologous environments such as Escherichia coli or 
mammalian cells. This is due, in part, to differences in 
codon usage, which can significantly decrease transla- 
tion efficiency and thereby lower protein production in a 
heterologous system (Hernan et al. 1992; Kane 1995). In 
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our laboratory, we have experienced difficulties with 
heterologous expression of a recently cloned Schistoso- 
ma mansoni cDNA (GenBank Accession No. 
AF031196). The cDNA, which encodes a novel 
S. mansoni G protein-coupled receptor (SmGPCR) 
(Hamdan et al. 2002), was found to express at very low 
levels in transiently transfected HEK293 cells. Attempts 
to improve expression by varying transfection condi- 
tions, the expression vector or the expressing cell envi- 
ronment were all unsuccessful. A close examination of 
the DNA sequence of SmGPCR revealed that a signifi- 
cant number of amino acid residues were encoded by 
codons that are rarely represented in mammalian cells 
(Nakamura et al. 1999), in particular, T/CTA (Leu), 
ATA (He) and CGT/A (Arg). Moreover, the analysis 
showed that a majority (ft*70%) of these distinctive 
SmGPCR codons occurred in the first half of the cDNA 
sequence, frequently in clusters of two to three consec- 
utive codons, which can further contribute to low pro- 
tein production in the heterologous environment (Kane 
1995). To address this problem, we devised a strategy 
designed to rewrite the S. mansoni cDNA according to 
mammalian (human) preferred codon usage. Codon 
optimization has been used successfully to improve 
heterologous expression of cDNAs from other species 
(Hernan et al. 1992; Yang et al. 1996; Vervoort et al. 
2000) and a number of different experimental protocols 
have been described, which range from the modification 
of a few codons (Vervoort et al. 2000) to the extensive 
rewriting of up to 1,000 bp of DNA (Hale and 
Thompson 1998). DNA rewriting is typically done by 
recursive (overlap) PCR (Prodromou and Pearl 1992), a 
technique in which the desired sequence is produced in 
vitro from the extension of multiple overlapping oli- 
gonucleotide primers. Existing protocols for rewriting 
long DNA molecules ( >600 bp), which are more diffi- 
cult to synthesize in vitro, require multistep procedures 
of up to three consecutive rounds of recursive PCR 
(Hale and Thompson 1998; Te'o et al. 2000). In the 
present study, we describe a simpler method, which was 
optimized to rewrite the first 855 bp of the 5. mansoni 
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cDNA in a single PCR reaction. The resulls presented 
here show that rewriting the first half of the cDNA was 
sufficient to dramatically increase the expression of full- 
length SmGPCR protein in cultured HEK293. 

The first half of the SmGPCR coding sequence 
(nucleotides 268-1,122; Gen Bank Accession No. 
AF031 196) was synthesized by a one-step recursive PCR 
that contained a total of 16 overlapping mutagenic 
primers and two external sense (PCRF) and antisense 
(PCRR) primers. A schematic representation of the re- 
cursive PCR strategy and a list of relevant primer se- 
quences are shown in Fig. 1. The mutagenic primers 
were designed so as to rewrite the S. mansoni cDNA 
using preferred human codons, as defined in the inter- 
national DNA sequence data base (Nakamura et al. 
1999), without changing the amino acid sequence en- 
coded by SmGPCR. The sequence of each mutagenic 
oligonucleotide was carefully checked to minimize the 
risk of hairpin formation, which is common in the longer 
primers, and to avoid the introduction of cryptic splice 
sites (GTA/TCAG) that may produce truncated prod- 
ucts. Adjacent pairs of mutagenic primers overlapped by 
19—21 bases so as to allow extension of a continuous 
DNA product in the recursive PCR reaction. The two 
sense and antisense external PCR primers (PCRF, 
PCRR, 25-34 mers) were directed towards the begin- 
ning and the end of the codon-optimized SmGPCR se- 
quence, respectively, and were used to amplify the final 
modified product (Fig. 1). The forward external primer 
and the first nested mutagenic primer (IF) were designed 

i F(80)-5« ATGcacritf»»Kry»rfa>ra»iaQjtf; aagcagtacatcaacaagaccagcctgaacagcagcgt 

CATCCCCCATACCCTGAT- 3 ' 

2W7SJ-S' CAGGAACAG«»GATGGTCCACTTGAT<»TAW 
TATOGGGGATGAC- 3 ' 

3K(80)-5' TO^CCATCTCCCTGTrCCTGATCATO^ 
TATCCTOGCCTTCATCAC-3* 

4R(82)-5' GCAGATCAGCCACGGCGAGGGACACGATGTATTGGTCGGTGATTCTCCTCACT 
GTGATGAAGGCCAGGATAAT-3 ' 

5F(85>-5' CTCGCCGTOGCTGATCFGCTGGTCAGCGTGCTGGTGCTGCCTCTGGCTATOtnXM 

CCTGGGATATTGGCCATTCGACA- 3 ' 

6R(72>-5' G<XCATG<*CAGCACGATGTTAGCGCTCA^ 

GCCAATATCC-3' 

7F(5J>-3' CATOGTGCTGTGCATGGCCTCAATTCTGJUVOCTGTGTTGCATTAGCCTGGACA- 3 ' 
8R<78)-S' OMGGTGAACCTGGTCCTCmGTAAAATATTTCATGGGTCT 
CCAGGCTAATGCAACA- 3 ' 

9K(78hS' GAGGACCAGGTTCACCGCCAGCACCATGATOGCCGTCGCX^ 
TCCTGCTCCCCTTCGT-3' 

lOR(84)-S' CCTATAAGCCTTCTTCTAGGTGATCrCGCAAGATCC 

1 IF(83)-5' CCTACAAO\AGGCTTATAG<^TCT 
GATCCCCTACATCTACCTGAGA-3 • 

I IR(W)-5 ' GGCTGCTCAGCTTGA T G I' I GG 'P ^ TCTGCAGGACCTTCAGTCTGTCCTTGATGATCCAGAA 
CACTCTCAGGTACATGTAGGC3GA- 3 » 

I3F(71 )-CCAACAK3UVGCTGAGCAGCCTGAAGAAGCCCAAGTCTt^^ 
CTCCCATC-3' 

14M(56)-5 ' CCTT6AT G 1 1 H CCCaCAOCTGCTGGAGGTTGATGATGATGGGAGCGGGTTTCCTG- 3 ' 

1 SF(«>-5' GTGTGGGAAAACATCAAGGGCAAAATCGGCAAASttSAACATCTT^ 3 ' 

l6R|58)-5*GTGACCACTATATCCACAG<n;(riTCTTAGACTTGCT^ 

PCRF( J4>- 5 ' ATA GCTAfiCCACC ATCGACTACAAGGACGACGAC- 3 ' 

PCRR(25>-S' GTGACCACXA7ATGCACAGCrCTTC-3 ' 

CF(J7>- 5' AACACCTGT CCATATACTCC CCATTTTTTCCATTCTG-3 * 

CR(58)-S' AATGCGGCCGCTTCAATCgTOCTGATGGTCQTO CTTATGCTTGTAGCTCTTAATATTC- 3 ' 



to introduce a Nhel site followed by a Kozak sequence 
(GCCACQ (Kozak 1999) immediately preceding the 
start ATG, and a 24 nucleotide sequence encoding a 
FLAG epitope (DYKDDDDK) (Hopp et al. 1988). The 
reverse primer was designed to introduce a flanking 
Yan9\l site for subsequent ligation of the PCR product 
into an expression construct (Fig. 1). For the recursive 
PCR, the 16 mutagenic primers (each at 0.02 uM) and 
the two external primers (each at 0.8 uM) were com- 
bined in a single reaction containing 2.5 units of a 
proofreading DNA polymerase (PWO), 2 mM MgS0 4 , 



Fig. 1 Construction of a codon-optimized SmGPCR expression 
plasmid. A nucleotide sequence (positions 269-1,122 of SmGPCR 
cDNA; GenBank Accession No. AF03I196), corresponding to 
approximately the first half of the SmGPCR coding sequence 
(Hamdan et al. 2001), was synthesized by recursive PCR, using a 
total of 16 overlapping mutagenic oligonucleotides and two shorter 
external PCR primers (PCRF and PCRR). The oligonucleotides 
were designed to rewrite the cDNA according to human preferred 
codon usage (Nakamura et al. 1999), as described in the text. 
Complete primer sequences are shown (inset). Forward (sense) and 
reverse (antisense) oligonucleotides are indicated by F and R, 
respectively. The positions of start and stop codons are underlined. 
Restriction sites (Nhel, Vart9\l t Notl) are shown in italics. The 
Kozak sequence (GCCACQ, and the FLAG and oligohexahisti- 
dine (6xHis) epitopes are shown in bold. CFand CR are the primers 
used to amplify the wild-type second half of the SmGPCR sequence 
(nucleotides 1,123-1,950 of SmGPCR cDNA). Tlie two codon- 
optimized (hatched box) and wild-type segments were ligated at a 
Van9\\ site. The resulting chimeric cDNA was then ligated into a 
pCEP4 vector at the NhelfNotl sites 
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200 uM of each dNTP and a standard PCR buffer. The 
cycling protocol was exactly as recommended for the 
proofreading PWO DNA polymerase (Roche Molecular 
Biochemicals) at an annealing temperature of 60°C. We 
found that the success of the recursive PCR reaction was 
dependent mainly on the ratio of the external PCR 
primers (PCRF, PCRR) relative to the concentration of 
internal mutagenic oligonucleotides. Four concentra- 
tions of mutagenic primers were tested, 0.005, 0.01, 0.02 
and 0.04 uM. A ratio of 0.8 uM of each external primer 
to 0.02 uM of each mutagenic primer gave the highest 
specificity and yield. 

The remaining half of the SmGPCR coding sequence 
(1,123-1,950; GenBank Accession No. AF031196) was 
amplified in a standard PCR, using a sense primer (CF) 
that introduced a Van9\\ site and overlapped with the 
3'-end of the above codon-optimized sequence (Fig. 1, 
inset). The antisense primer (CR) incorporated a six 
histidine epitope, followed by a stop codon and a single 
adenosine immediately after the stop codon (TGAA). 
The adenosine was added in order to increase transla- 
tional termination efficiency in the mammalian cell en- 
vironment (McCaughan et al. 1995). The reverse primer 
also introduced a Noil restriction site for subsequent 
ligation of the product into an expression vector. To 
obtain a complete SmGPCR, the codon-optimized and 
wild-type PCR products were gel purified and digested 
with Nhe\IVan9\Y and Van9\l/Notl, respectively. Sub- 
sequently, the two fragments were fused together at the 
Van9\l site and then Iigated into Nhel/Notl sites of di- 
gested pCEP4 mammalian expression vector (Invitro- 
gen) (Fig. 1). 

pCEP4 expression constructs carrying partially co- 
don-optimized SmGPCR were transiently transfected 
into HEK293(EBNAl) cells (Invitrogen), using Fu- 
GENE6 (Roche Molecular Biochemicals), according to 
the manufacturer's recommendations. The level of 
expression of SmGPCR in the transfected HEK293 
(EBNAl) cells was monitored by Western blotting and 
in situ immunofluorescence with antibodies that targeted 
the N-terminal FLAG epitope and the C-terminal oli- 
gohistidine epitope. Among the three codon-optimized 
clones tested initially, all were found to react with both 
antibodies, and were thus presumed to be full-length 
(not shown). One of these clones was subsequently 
confirmed by DNA sequencing of the entire insert and 
was selected for further analysis. Figure 2 shows a typ- 
ical in situ immunofluorescence assay of cells transiently 
expressing SmGPCR, using an anti-FLAG (M2) anti- 
body (Sigma) followed by a secondary antibody conju- 
gated to fluorescein. The results revealed significant 
fluorescence in cells transfected with the codon-opti- 
mized form of SmGPCR (Fig. 2A, C). Fluorescence was 
localized mainly in the cell periphery (Fig. 2C), consis- 
tent with the 'S. mansoni receptor being expressed at high 
levels and correctly targeted to the cell membrane. In 
contrast, a parallel experiment using a wild- type 
SmGPCR pCEP4 construct, which also carried an 
N-terminal FLAG epitope, but was not codon-opti- 
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Fig. 2A-D Effect of codon -optimization on SmGPCR expression 
in HEIC293(EBNAl) cells. In situ immunofluorescence analyses: 
Cells were seeded onto coverslips in six-well plates (2-3xl0 5 cells/ 
well), grown to approximately 80% confluency and then transfect- 
ed with l u.g of a pCEP4 expression plasmid containing either 
codon-optimized SmGPCR (A t C) or wild-type SmGPCR (B), each 
fused at the N-terminal end to a FLAG epitope (Hopp et al. 1988). 
About 48 h post-transfection, the cells were fixed on ice with 4% 
paraformaldehyde (Sigma) and then incubated with a monoclonal 
antibody directed against the FLAG epitope (anti-FLAG(M2), 
Sigma; 5 ug/ml] followed by a secondary fluorescein isothtocya- 
nate-conjugated antibody (goat anti-mouse IgG; Sigma; 1:2000 
dilution). Cells were examined by fluorescence microscopy and 
conventional visible light, using a Nikon Optiphot-2 fluorescence 
microscope equipped with Nomarski optics. Transfection efficien- 
cy, measured in parallel experiments, was approximately 40% for 
both populations of cells. C A closer view of a transfected cell 
shows expression of the codon-optimized form of SmGPCR mainly 
in the cell periphery. D Western blot analysis: approximately 
M.SxlO 6 H EK293(EBN A I ) cells were transfected with 3 ug each 
of codon-optimized SmGPCR-FLAG (lane /), wild-type 
SmGPCR-FLAG (lane 2), or pCEP4 plasmid alone without insert 
(lane 5). Cells were lysed 48 h post-transfection and a crude 
28,000 g membrane fraction was prepared from each population of 
cells. Aliquots of the three membrane preparations containing the 
same amount of total protein (20 ug) were resolved on 1 2% sodium 
dodecyl sulfate polyacrylamide gels and subsequently electroblot- 
ted onto nitrocellulose Alters (Novex). Filters were incubated with 
anti-FLAG(M2) antibody (1:1000 dilution) followed by a horse- 
radish peroxidase-conjugated secondary antibody (1:2000 dilution), 
according to standard procedures. Western positive bands were 
detected on film using Lumilight (Roche). The positions of relevant 
protein standards are indicated 

mized, produced very low fluorescence in transiently 
transfected HEK293(EBNA1) cells (Fig. 2B). Similar 
results were obtained from a comparative Western blot 
analysis of the same codon-optimized and wild-type 
receptor species (Fig. 2D). Aliquots of a 28,000 g crude 
membrane fraction containing identical amounts of total 
protein (20 ug) were resolved by sodium dodecyl sulfate 
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polyacrylamide gel electrophoresis and immunoblotted 
with the anti-FLAG (M2) antibody, according to 
standard protocols. A strong immunoreactive band 
corresponding to the SmGPCR monomer (approxi- 
mately 65 kDa) was observed in samples transfected 
with codon-optimized receptor (lane 1) whereas the 
wild-type produced a much weaker, barely detectable 
band (lane 2). No signal was detected in the 28,000 g 
supernatant of either transfection (not shown) or in 
membranes of control cells transfected with pCEP4 
plasmid only (lane 3). Taken together, the results clearly 
demonstrate that the codon-re writing strategy was suc- 
cessful in increasing SmGPCR expression in the mam- 
malian cell. 

In conclusion, this study describes a relatively simple, 
PCR-based co don- optimization procedure that can be 
used to improve the translation efficiency of any cDNA 
in a heterologous environment. It is noteworthy that this 
strategy was successful without having to rewrite the 
entire SmGPCR cDNA sequence, which would have 
complicated the recursive PCR further. Rewriting only 
the first half of SmGPCR, which contained the highest 
proportion of rare codon clusters, was sufficient to in- 
crease protein expression, presumably because the lower 
frequency of rare codons allowed for higher translation 
efficiency in the HEK293 cells. The same strategy could 
be applied to rewrite any other region of a cDNA of 
interest, at least up to 853 bp and, possibly, even longer 
sequences. This method is potentially useful for heter- 
ologous expression of cDNAs from helminths as well as 
other parasites, including protozoans, which have dis- 
tinctive codon preferences. 
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